Search CORE

66 research outputs found

Randomized cache placement for eliminating conflicts

Author: González Colás Antonio María
Topham Nigel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/1999
Field of study

Applications with regular patterns of memory access can experience high levels of cache conflict misses. In shared-memory multiprocessors conflict misses can be increased significantly by the data transpositions required for parallelization. Techniques such as blocking which are introduced within a single thread to improve locality, can result in yet more conflict misses. The tension between minimizing cache conflicts and the other transformations needed for efficient parallelization leads to complex optimization problems for parallelizing compilers. This paper shows how the introduction of a pseudorandom element into the cache index function can effectively eliminate repetitive conflict misses and produce a cache where miss ratio depends solely on working set behavior. We examine the impact of pseudorandom cache indexing on processor cycle times and present practical solutions to some of the major implementation issues for this type of cache. Our conclusions are supported by simulations of a superscalar out-of-order processor executing the SPEC95 benchmarks, as well as from cache simulations of individual loop kernels to illustrate specific effects. We present measurements of instructions committed per cycle (IPC) when comparing the performance of different cache architectures on whole-program benchmarks such as the SPEC95 suite.Peer ReviewedPostprint (published version

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Run-time integrity monitoring of untrustworthy analog front-ends

Author: Salem Heba
Topham Nigel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 02/06/2023
Field of study

Edinburgh Research Explorer

Trustworthy computing on untrustworthy and Trojan-infected on-chip interconnects

Author: Salem Heba
Topham Nigel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 29/06/2021
Field of study

Edinburgh Research Explorer

Detecting denial-of-service hardware Trojans in DRAM-based memory systems

Author: Salem Heba
Topham Nigel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 10/01/2022
Field of study

Edinburgh Research Explorer

Performance of the decoupled ACRI-1 architecture: The perfect club

Author: McDougall Kenneth
Topham Nigel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1995
Field of study

Crossref

Edinburgh Research Explorer

The Smart Cache: An Energy-Efficient Cache Architecture Through Dynamic Adaptation

Author: Jones Timothy M.
Sundararajan Karthik T.
Topham Nigel P.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2013
Field of study

Edinburgh Research Explorer

Performance of Weak Consistency Schemes on the DEC Alpha

Author: Harris T.
Topham Nigel P.
Publication venue
Publication date: 01/01/1993
Field of study

Edinburgh Research Explorer

Poise: Balancing Thread-Level Parallelism and Memory System Performance in GPUs using Machine Learning

Author: Dublish Saumay
Nagarajan Vijayanand
Topham Nigel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/03/2019
Field of study

Crossref

Edinburgh Research Explorer

Cycle-accurate performance modelling in an ultra-fast just-in-time dynamic binary translation instruction set simulator

Author: Bohm Igor
Franke Bjoern
Topham Nigel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2010
Field of study

Abstract. Instruction set simulators (ISS) are vital tools for compiler and processor architecture design space exploration and verification. State-of-the-art simulators using just-in-time (JIT) dynamic binary translation (DBT) techniques are able to simulate complex embedded processors at speeds above 500 MIPS. However, these functional ISS do not provide microarchitectural observability. In contrast, low-level cycle-accurate ISS are too slow to simulate full-scale applications, forcing developers to revert to FPGA-based simulations. In this paper we demonstrate that it is possible to run ultra-high speed cycle-accurate instruction set simulations surpassing FPGA-based simulation speeds. We extend the JIT DBT engine of our ISS and augment JIT generated code with a verified cycle-accurate processor model. Our approach can model any microarchitectural configuration, does not rely on prior profiling, instrumentation, or compilation, and works for all binaries targeting a state-of-the-art embedded processor implementing the ARCompact TM instruction set architecture (ISA). We achieve simulation speeds up to 88 MIPS on a standard x86 desktop computer for the industry standard EEMBC, COREMARK and BIOPERF benchmark suites.

CiteSeerX

Crossref

Edinburgh Research Explorer

Cooperative Caching for GPUs

Author: Dublish Saumay
Nagarajan Vijay
Topham Nigel
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/12/2016
Field of study

Edinburgh Research Explorer